CDS

Accession Number TCMCG064C09464
gbkey CDS
Protein Id XP_011075521.1
Location complement(join(4100110..4100122,4100480..4100544,4100661..4100924,4101110..4101364,4101450..4101845,4102219..4102785))
Gene LOC105159982
GeneID 105159982
Organism Sesamum indicum

Protein

Length 519aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA268358
db_source XM_011077219.2
Definition KH domain-containing protein HEN4 [Sesamum indicum]

EGGNOG-MAPPER Annotation

COG_category A
Description KH domain-containing protein
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko03019        [VIEW IN KEGG]
KEGG_ko ko:K21444        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGGGCCAGCGAAGTGATTACGGAAAAGGGTCGCAGGTACTGTCTGAATATTCTGGAAATGGAGGGAAAAGAAGAAATGTGAATGATGAAAAAGAACAAAACTCTATTGGACTGGATCATACGGTTTACCGTTATTTGTGCCCTTTGAGAAAAATTGGGAGCATTATTGGTATTGGTGGTGATATTGCTAAGCAGTTGAGGGCCCAGACTAATGCGAGGATTAGGATTAGTGAAACTATTCCGGGGTGTGACGAACGCGTTATCACCATTTATAGCACAAGTCATGAAACCAACATTTATGGAAATGAACACATTTCTCCTGCACAGGATGCGCTGTTTAGGGTGCATGATAGGGTGGTGGCTGAAGAACCGCCACTGAATGGTCCGTTCGAGGAACCTCAGCAAGTTACTCTGCGCTTGCTTGTCCCATCAGATCAGATAGGTTGTGTGATCGGTAAAGGGGGGCAAATAATTCAGAACATACGCAATGAGACTCATGCTCAGATCAGAATTTTGGGCAGCGATCATTTACCACCTTGTGCTCTGAGTTCTGATGAACTTATCCAGATAAATGGGGAAGCCACTGTTGTGAAGAAAGCTCTTTATGAAGTCGCGTTTCGTCTTCATGAAAACCCGTCGCGGTCACAGCAATCACTATTGAGTAGTCCAAGCATTTACAGATCTGGAATTACATTTAGTAATCCACATGTAGGTGGACGACCTGTTGGTGTGACCTCATTAATGGGTCCTTATGGGAATTACAAAAACAACGGCAGAGATTGGTCTTTTATGATGAAGGAATTCGCACTTCGTTTAGTTTGTCCAACTGAAAATCTTGGTGCCGTAATAGGCAAAAGTGGTGCCATTGTCAAACAAATAAGACAGGAATCAGGTGCATCTATAATAGTTGATAGTTCTGGTGCTGATGAAGATGACTGTATTATATCCGTCTCTGCTAAGGAGTTGTTTGAAGCTCCTTCTCGGACTATTGATGCGGTGATGCGGTTGCAACCGAGGTGCAGTGGGAAAATGGAAAGAGATTCAGGTGATTCTGTGATCATAACTCGTTTGCTAGTCTCAAGCTCAAGAATTGGATGTATCATTGGTAAAGGTGGGGCAATTATTAAAGAGATGAGGAGTACCTCTAGAGCAAACATTCGTATTTTTTCTGACGAGAGTGTTCCCAAAGTTGCATCTGAAGATGATGAGATGGTCCAGATAACTGGGGATGCGCATGCTGCTAAAAATGCATTGTTACAAGTAATGCAGCGCTTGAGAACCAATGTATTTGAGAATGATGGAAATTCGTCCGCATTTCCTATACCTGCTCAATCTCTTGCAACATCAACAGAGACATTTGGTCAAAAGTATGTAACTCCTGATAACAGAACACGTAATCCAGGATATTCTACTTACTCTGGTGGCTACAGTTCTAAAACCTTGCCTTCAACTGGCAACTATGGGAATTATGACGATTCACAGGTTGTCAGTGAAAGTGCTTATGGAGCATATCCTGTTTATTCAGCTGGTCGCCCTACCGGTTCCAGCTATGCTGCATGA
Protein:  
MGQRSDYGKGSQVLSEYSGNGGKRRNVNDEKEQNSIGLDHTVYRYLCPLRKIGSIIGIGGDIAKQLRAQTNARIRISETIPGCDERVITIYSTSHETNIYGNEHISPAQDALFRVHDRVVAEEPPLNGPFEEPQQVTLRLLVPSDQIGCVIGKGGQIIQNIRNETHAQIRILGSDHLPPCALSSDELIQINGEATVVKKALYEVAFRLHENPSRSQQSLLSSPSIYRSGITFSNPHVGGRPVGVTSLMGPYGNYKNNGRDWSFMMKEFALRLVCPTENLGAVIGKSGAIVKQIRQESGASIIVDSSGADEDDCIISVSAKELFEAPSRTIDAVMRLQPRCSGKMERDSGDSVIITRLLVSSSRIGCIIGKGGAIIKEMRSTSRANIRIFSDESVPKVASEDDEMVQITGDAHAAKNALLQVMQRLRTNVFENDGNSSAFPIPAQSLATSTETFGQKYVTPDNRTRNPGYSTYSGGYSSKTLPSTGNYGNYDDSQVVSESAYGAYPVYSAGRPTGSSYAA